Photo Semantic Understanding and Retargeting by a Noise-Robust Regularized Topic Model
نویسندگان
چکیده
Retargeting aims at displaying a photo with an arbitrary aspect ratio, wherein the visually/semantically prominent objects are appropriately preserved and visual distortions can be well alleviated. Conventional retargeting models built upon perception of photos from family pre-specified communities ( e.g. , “portrait”), underlying community-specific features not learned explicitly. Thus they cannot retarget aerial photos, which contains rich variety different scales. In this work, novel framework is designed by encoding deep automatically detected Google Maps into regularized probabilistic model. Specifically, we first propose enhanced matrix factorization (MF) algorithm to calculate based on million-scale pictures, for each feature simultaneously. The MF incorporates label denoising, between-communities correlation, collaboratively. Subsequently, model called LTM that quantifies spatial layouts multiple in hidden space. To alleviate overfitting imbalanced numbers regularizer added LTM. Finally, leveraging LTM, shrink test horizontially/vertically maximize posterior probability retargted photo. Comprehensive subjective evaluations visualizations have demonstrated advantages our method. Besides, competitively consistent ground truth, according quantitative comparisons 2M photos.
منابع مشابه
SR-clustering: Semantic regularized clustering for egocentric photo streams segmentation
While wearable cameras are becoming increasingly popular, locating relevant information in large unstructured collections of egocentric images is still a tedious and time consuming process. This paper addresses the problem of organizing egocentric photo streams acquired by a wearable camera into semantically meaningful segments, hence making an important step towards the goal of automatically a...
متن کاملUnderstanding the semantic principles of a political map
The attempt to recognize phenomena and affairs has always been a concern of the human mind and has constantly sought to complete this knowledge. The correct recognition is also achieved when the real nature of phenomena is clear to man. The phenomena are based on their own philosophical foundations and, therefore, their understanding requires perception these philosophical foundations and using...
متن کاملSemantic Understanding and Commonsense Reasoning in an Adaptive Photo Agent
In a story telling authoring task, an author often wants to set up meaningful connections between different media, such as between a text and photographs. To facilitate this task, it is helpful to have a software agent dynamically adapt the presentation of a media database to the user's authoring activities, and look for opportunities for annotation and retrieval. Expecting the user to manually...
متن کاملImproving Topic Coherence with Regularized Topic Models
Topic models have the potential to improve search and browsing by extracting useful semantic themes from web pages and other text documents. When learned topics are coherent and interpretable, they can be valuable for faceted browsing, results set diversity analysis, and document retrieval. However, when dealing with small collections or noisy text (e.g. web search result snippets or blog posts...
متن کاملA Regularized Latent Semantic Indexing: A New Approach to Large Scale Topic Modeling
Topic modeling provides a powerful way to analyze the content of a collection of documents. It has become a popular tool in research areas such as text mining, information retrieval, natural language processing, and other related fields. In realworld applications, however, the usefulness of topic modeling is limited due to scalability issues. Scaling to larger document collections via paralleli...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
سال: 2023
ISSN: ['2151-1535', '1939-1404']
DOI: https://doi.org/10.1109/jstars.2023.3247745